A Binaural Model for Missing Data Speech Recognition in Noisy and Reverberant Conditions

نویسندگان

Kalle J. Palomäki

Guy J. Brown

DeLiang Wang

چکیده

We describe a binaural auditory model for speech recognition, which is robust in the presence of reverberation and spatially separated noise intrusions. The principle underlying the model is to identify time-frequency regions which constitute reliable evidence of the speech signal. This is achieved both by determining the spatial location of the speech source, and by applying a simple model of reverberation masking. Reliable time-frequency regions are passed to a missing data speech recogniser. We show, firstly, that the auditory model improves recognition performance in various reverberation conditions when no noise intrusion is present. Secondly, we demonstrate that the model improves performance when the speech signal is contaminated by noise, both for an anechoic environment and in the presence of simulated room reverberation.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A binaural processor for missing data speech recognition in the presence of noise and small-room reverberation

In this study we describe a binaural auditory model for recognition of speech in the presence of spatially separated noise intrusions, under small-room reverberation conditions. The principle underlying the model is to identify time–frequency regions which constitute reliable evidence of the speech signal. This is achieved both by determining the spatial location of the speech source, and by gr...

متن کامل

Mask estimation and sparse imputation for missing data speech recognition in multisource reverberant environments

This work presents an automatic speech recognition system which uses a missing data approach to compensate for environmental noise. The missing, noise-corrupted components are identified using binaural features or a support vector machine (SVM) classifier. To perform speech recognition using the partially observed data, the missing components are substituted with clean speech estimates calculat...

متن کامل

Adaptive beamforming and soft missing data decoding for robust speech recognition in reverberant environments

This paper presents a novel approach to combine microphone array processing and robust speech recognition for reverberant multi-speaker environments. Spatial cues are extracted from a microphone array and automatically clustered to estimate localization masks in the time-frequency domain. The localization masks are then used to blindly design adaptive filters in order to enhance the source sign...

متن کامل

A hearing-inspired approach for distant-microphone speech recognition in the presence of multiple sources

This paper addresses the problem of speech recognition in reverberant multisource noise conditions using distant binaural microphones. Our scheme employs a two-stage fragment decoding approach inspired by Bregman’s account of auditory scene analysis, in which innate primitive grouping ‘rules’ are balanced by the role of learnt schema-driven processes. First, the acoustic mixture is split into l...

متن کامل

Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions

Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its perfor...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2001

A Binaural Model for Missing Data Speech Recognition in Noisy and Reverberant Conditions

نویسندگان

چکیده

منابع مشابه

A binaural processor for missing data speech recognition in the presence of noise and small-room reverberation

Mask estimation and sparse imputation for missing data speech recognition in multisource reverberant environments

Adaptive beamforming and soft missing data decoding for robust speech recognition in reverberant environments

A hearing-inspired approach for distant-microphone speech recognition in the presence of multiple sources

Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions

عنوان ژورنال:

اشتراک گذاری